Search results for "Motif extraction"

showing 10 items of 11 documents

Discovering representative models in large time series databases

2004

The discovery of frequently occurring patterns in a time series could be important in several application contexts. As an example, the analysis of frequent patterns in biomedical observations could allow to perform diagnosis and/or prognosis. Moreover, the efficient discovery of frequent patterns may play an important role in several data mining tasks such as association rule discovery, clustering and classification. However, in order to identify interesting repetitions, it is necessary to allow errors in the matching patterns; in this context, it is difficult to select one pattern particularly suited to represent the set of similar ones, whereas modelling this set with a single model could…

Association rule learningDiscretizationComputer scienceContext (language use)Correlation and dependencecomputer.software_genreSet (abstract data type)CardinalityKnowledge extractionMotif extraction Pattern discoveryPattern matchingData miningCluster analysisTime complexitycomputer
researchProduct

IP6K gene identification by tag search

2010

Bioinformatics Motif extraction String analysis
researchProduct

IP6K gene identification in plant cells via tag discovery

2010

Bioinformatics Motif extraction String analysis
researchProduct

IP6K gene identification in plant genomes by tag searching

2011

Abstract Background Plants have played a special role in inositol polyphosphate (IP) research since in plant seeds was discovered the first IP, the fully phosphorylated inositol ring of phytic acid (IP6). It is now known that phytic acid is further metabolized by the IP6 Kinases (IP6Ks) to generate IP containing pyro-phosphate moiety. The IP6K are evolutionary conserved enzymes identified in several mammalian, fungi and amoebae species. Although IP6K has not yet been identified in plant chromosomes, there are many clues suggesting its presences in vegetal cells. Results In this paper we propose a new approach to search for the plant IP6K gene, that lead to the identification in plant genome…

Bioinformatics Motif extraction String analysisGeneticsMitochondrial DNAOryza sativaNuclear genebiologyNucleic acid sequencefood and beveragesChromosomeGeneral Medicinebiology.organism_classificationGenomeGeneral Biochemistry Genetics and Molecular BiologyProceedingsArabidopsis thalianaGeneBMC Proceedings
researchProduct

Searching for repetitions in biological networks: methods, resources and tools

2013

We present here a compact overview of the data, models and methods proposed for the analysis of biological networks based on the search for significant repetitions. In particular, we concentrate on three problems widely studied in the literature: ‘network alignment’, ‘network querying’ and ‘network motif extraction’. We provide (i) details of the experimental techniques used to obtain the main types of interaction data, (ii) descriptions of the models and approaches introduced to solve such problems and (iii) pointers to both the available databases and software tools. The intent is to lay out a useful roadmap for identifying suitable strategies to analyse cellular data, possibly based on t…

Cellular datanetwork global alignmentnetwork local alignmentbiological networks analysiSettore INF/01 - Informaticabusiness.industryComputer sciencenetwork queryingComputational Biologynetwork motif extractionModels Theoreticalcomputer.software_genreData typeNetwork motifSoftwareNetwork alignmentData miningbusinessMolecular Biologycomputerasymmetric alignmentBiological networkSoftwareInformation Systems
researchProduct

Characterization and Extraction of Irredundant Tandem Motifs

2012

We address the problem of extracting pairs of subwords (m1,m2) from a text string s of length n, such that, given also an integer constant d in input, m1 and m2 occur in tandem within a maximum distance of d symbols in s. The main effort of this work is to eliminate the possible redundancy from the candidate set of the so found tandem motifs. To this aim, we first introduce the concept of maximality, characterized by four specific conditions, that we show to be not deducible by the corresponding notion of maximality already defined for "simple" (i.e., non tandem) motifs. Then, we further eliminate the remaining redundancy by defining the concept of irredundancy for tandem motifs. We prove t…

Discrete mathematicsRedundancy (information theory)TandemMotif extraction Pattern discoveryText stringLinear numberMathematics
researchProduct

Motif patterns in 2D

2008

AbstractMotif patterns consisting of sequences of intermixed solid and don’t-care characters have been introduced and studied in connection with pattern discovery problems of computational biology and other domains. In order to alleviate the exponential growth of such motifs, notions of maximal saturation and irredundancy have been formulated, whereby more or less compact subsets of the set of all motifs can be extracted, that are capable of expressing all others by suitable combinations. In this paper, we introduce the notion of maximal irredundant motifs in a two-dimensional array and develop initial properties and a combinatorial argument that poses a linear bound on the total number of …

General Computer SciencePattern discoveryTheoretical Computer ScienceCombinatoricsExponential growthMotif extraction Pattern discovery 2D MotifsMotif2D irredundant motifsMotif (music)Pattern matchingRemainderPattern matchingDesign and analysis of algorithmsMathematicsComputer Science(all)Theoretical Computer Science
researchProduct

Flexible pattern discovery with (extended) disjunctive logic programming

2005

The post-genomic era showed up a wide range of new challenging issues for the areas of knowledge discovery and intelligent information management. Among them, the discovery of complex pattern repetitions in string databases plays an important role, specifically in those contexts where even what are to be considered the interesting pattern classes is unknown. This paper provides a contribution in this precise setting, proposing a novel approach, based on disjunctive logic programming extended with several advanced features, for discovering interesting pattern classes from a given data set.

Information managementRange (mathematics)Knowledge extractionbusiness.industryComputer scienceLogical programmingDisjunctive programmingInformation systemMotif extraction Pattern discoveryArtificial intelligenceLevenshtein distancebusinessK-optimal pattern discovery
researchProduct

Derivazione Efficiente di Pattern Strutturati Frequenti da Database di Natura Biologica

2004

Motif extraction Pattern discovery
researchProduct

Optimal extraction of motif patterns in 2D

2009

The combinatorial explosion of motif patterns occurring in 1D and 2D arrays leads to the consideration of special classes of motifs growing linearly with the size of the input array. Such motifs, called irredundant motifs, are able to succinctly represent all of the other motifs occurring in the same array within reasonable time and space bounds. In previous work irredundant motifs were extracted from 2D arrays in O (N 2 log 2 n log log n) and O (N 3) time, where N is the size of the 2D input array and n is its largest dimension. In this paper, we present an algorithm to extract irredundant motifs from 2D arrays that is quadratic in the size of the input. The input is defined on a binary al…

Motif extraction Pattern discovery
researchProduct